Statistical Methods
نویسنده
چکیده
Statistics represents that body of methods by which characteristics of a population are inferred through observations made in a representative sample from that population. Since scientists rarely observe entire populations, sampling and statistical inference are essential. This article first discusses some general principles for the planning of experiments and data visualization. Then, a strong emphasis is put on the choice of appropriate standard statistical models and methods of statistical inference. (1) Standard models (binomial, Poisson, normal) are described. Application of these models to confidence interval estimation and parametric hypothesis testing are also described, including two-sample situations when the purpose is to compare two (or more) populations with respect to their means or variances. (2) Non-parametric inference tests are also described in cases where the data sample distribution is not compatible with standard parametric distributions. (3) Resampling methods using many randomly computer-generated samples are finally introduced for estimating characteristics of a distribution and for statistical inference. The following section deals with methods for processing multivariate data. Methods for dealing with clinical trials are also briefly reviewed. Finally, a last section discusses statistical computer software and guides the reader through a collection of bibliographic references adapted to different levels of expertise and topics. Statistics can be called that body of analytical and computational methods by which characteristics of a population are inferred through observations made in a representative sample from that population. Since scientists rarely observe entire populations, sampling and statistical inference are essential. Although, the objective of statistical methods is to make the process of scientific research as efficient and productive as possible, many scientists and engineers have inadequate training in experimental design and in the proper selection of statistical analyses for experimentally acquired data. John L. Gill [1] states: “...statistical analysis too often has meant the manipulation of ambiguous data by means of dubious methods to solve a problem that has not been defined.” The purpose of this article is to provide readers with definitions and examples of widely used concepts in statistics. This article first discusses some general principles for the planning of experiments and data visualization. Then, since we expect that most readers are not studying this article to learn statistics but instead to find practical methods for analyzing data, a strong emphasis has been put on choice of appropriate standard statistical model and statistical inference methods (parametric, non-parametric, resampling methods) for different types of data. Then, methods for processing multivariate data are briefly reviewed. The section following it deals with clinical trials. Finally, the last section discusses computer software and guides the reader through a collection of bibliographic references adapted to different levels of expertise and topics. DATA SAMPLE AND EXPERIMENTAL DESIGN Any experimental or observational investigation is motivated by a general problem that can be tackled by answering specific questions. Associated with the general problem will be a population. For example, the population can be all human beings. The problem may be to estimate the probability by age bracket for someone to develop lung cancer. Another population may be the full range of responses of a medical device to measure heart pressure and the problem may be to model the noise behavior of this apparatus. Often, experiments aim at comparing two subpopulations and determining if there is a (significant) difference between them. For example, we may compare the frequency occurrence of lung cancer of smokers compared to non-smokers or we may compare the signal to noise ratio generated by two brands of medical devices and determine which brand outperforms the other with respect to this measure. How can representative samples be chosen from such populations? Guided by the list of specific questions, samples will be drawn from specified sub-populations. For example, the study plan might specify that 1000 presently cancer-free persons will be drawn from the greater Los Angeles area. These 1000 persons would be composed of random samples of specified sizes of smokers and non-smokers of varying ages and occupations. Thus, the description of the sampling plan will imply to some extent the nature of the target subpopulation, in this case smoking individuals. Choosing a random sample may not be easy and there are two types of errors associated with choosing representative samples: sampling errors and non-sampling errors. Sampling errors are those errors due to chance variations resulting from sampling a population. For example, in a population of 100,000 individuals, suppose that 100 have a certain genetic trait and in a (random) sample of 10,000, 8 have the trait. The experimenter will estimate that 8/10,000 of the population or 80/100,000 individuals have the trait, and in doing so will have underestimated the actual percentage. Imagine conducting this experiment (i.e., drawing a random sample of 10,000 and examining for the trait) repeatedly. The observed number of sampled individuals having the trait will fluctuate. This phenomenon is called the sampling error. Indeed, if sampling
منابع مشابه
Statistical Methods for the Analysis of Repeated Measurements
The best ebooks about Statistical Methods For The Analysis Of Repeated Measurements that you can get for free here by download this Statistical Methods For The Analysis Of Repeated Measurements and save to your desktop. This ebooks is under topic such as statistical methods for the analysis of repeated measurements statistical methods for the analysis of repeated measurements statistical method...
متن کاملStatistical methods used in articles published by the Journal of Periodontal and Implant Science
PURPOSE The purposes of this study were to assess the trend of use of statistical methods including parametric and nonparametric methods and to evaluate the use of complex statistical methodology in recent periodontal studies. METHODS This study analyzed 123 articles published in the Journal of Periodontal & Implant Science (JPIS) between 2010 and 2014. Frequencies and percentages were calcul...
متن کاملMeta-analysis and The Cochrane Collaboration: 20 years of the Cochrane Statistical Methods Group
The Statistical Methods Group has played a pivotal role in The Cochrane Collaboration over the past 20 years. The Statistical Methods Group has determined the direction of statistical methods used within Cochrane reviews, developed guidance for these methods, provided training, and continued to discuss and consider new and controversial issues in meta-analysis. The contribution of Statistical M...
متن کاملStatistical Methods Statistical Methods#computational Linguistics#machine Learning#stochastic Grammars
" Statistical methods " refers here specifically to statistical methods in computational linguistics. This represents a new body of practice in computational linguistics that has become standard over the last decade.
متن کاملBusiness Sample Survey Measurement on Statistical Thinking and Methods Adoption: the Case of Croatian Small Enterprises
The objective of this research is to investigate attitudes of management in Croatian small enterprises that use statistical methods towards statistical thinking in order to gain an insight into related issues. The research was conducted in 2013 using a web survey with a random sample of 631 Croatian small enterprises, but this paper focuses only on those enterprises that use statistical methods...
متن کاملStatistical Graphics : Developments from Statistical Practice into Statistical
Graphical methods have entered the mainstream of both statistical applications and research in statistical methodology. This development is due in part to the revolution in computing and in part to the stimulating ideas of exploratory data analysis, but the main reason is simply that graphical methods work. Appropriately chosen plots reveal much about the data, and people who start to use them ...
متن کامل